Picture for Qin Liu

Qin Liu

University of California Davis

Human-LLM Collaborative Feature Engineering for Tabular Data

Add code
Jan 28, 2026
Viaarxiv icon

FRIEDA: Benchmarking Multi-Step Cartographic Reasoning in Vision-Language Models

Add code
Dec 08, 2025
Viaarxiv icon

ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation

Add code
Oct 09, 2025
Figure 1 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 2 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 3 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 4 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Viaarxiv icon

False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize

Add code
Sep 04, 2025
Figure 1 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 2 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 3 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 4 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Viaarxiv icon

QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA

Add code
Jun 09, 2025
Figure 1 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 2 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 3 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 4 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Viaarxiv icon

Exploring Scaling Laws for EHR Foundation Models

Add code
May 29, 2025
Figure 1 for Exploring Scaling Laws for EHR Foundation Models
Figure 2 for Exploring Scaling Laws for EHR Foundation Models
Figure 3 for Exploring Scaling Laws for EHR Foundation Models
Figure 4 for Exploring Scaling Laws for EHR Foundation Models
Viaarxiv icon

RAP: Runtime-Adaptive Pruning for LLM Inference

Add code
May 26, 2025
Figure 1 for RAP: Runtime-Adaptive Pruning for LLM Inference
Figure 2 for RAP: Runtime-Adaptive Pruning for LLM Inference
Figure 3 for RAP: Runtime-Adaptive Pruning for LLM Inference
Figure 4 for RAP: Runtime-Adaptive Pruning for LLM Inference
Viaarxiv icon

MetaScale: Test-Time Scaling with Evolving Meta-Thoughts

Add code
Mar 17, 2025
Viaarxiv icon

Referring to Any Person

Add code
Mar 11, 2025
Figure 1 for Referring to Any Person
Figure 2 for Referring to Any Person
Figure 3 for Referring to Any Person
Figure 4 for Referring to Any Person
Viaarxiv icon

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Add code
Feb 22, 2025
Figure 1 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 2 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 3 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 4 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Viaarxiv icon